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The present invention relates to an improved barstar gene and improved 
barstar protein that can be used to neutralize the activity of a bamase in eucaryotic 
cells, particularly in plant cells. Thus the improved barstar gene can be used to 
produce fertility restorer plants capable of restoring the fertility to a line of male-sterile 
plants that contain in the nuclear genome of their cells a chimeric gene comprising 
a stamen-selective promoter and a DNA coding for a bamase. The present invention 
also relates to the restorer plants that contain in the nuclear genome of their cells the 
improved barstar gene. 
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Background to the I nvention 

In many, if not most plant species, the development of hybrid cultivars is highly* 

15 desired because of their generally increased productivity due to heterosis: the 
superiority of performance of hybrid individuals compared with their parents (see e.g. 
Fehr, 1987, Principles of cultivar development, Volume 1 : Theory and Technique, 
MacMillan Publishing Company, New York; Allard, 1960, Principles of Plant Breeding, 
John Wiley and Sons, Inc.). 

20 The development of hybrid cultivars of various plant species depends upon the 

capability to achieve almost complete cross-pollination between parents. This is most 
simply achieved by rendering one of the parent lines male sterile (i.e. bringing them 
in a condition so that pollen is absent or nonfunctional) either manually, by removing 
the anthers, or genetically by using, in the one parent, cytoplasmic or nuclear genes 

25 that prevent anther and/or pollen development (for a review of the genetics of male 
sterility in plants see Kaul, 1988, 'Male Sterility in Higher Plants', Springer Verlag). 

For hybrid plants where the seed is the harvested product (e.g. com, oilseed 
rape) it is in most cases also necessary to ensure that fertility of the hybrid plants is 
fully restored. In systems in which the male sterility is under genetic control this 

30 requires the existence and use of genes that can restore male fertility. The 
development of hybrid cultivars is mainly dependent on the availability of suitable and 
effective sterility and restorer genes. 
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Endogenous nuclear loci are known for most plant species that may contain 
genotypes which affect male sterility, and generally, such loci need to be 
homozygous for particular recessive alleles in order to result in a male-sterile 
phenotype. The presence of a dominant 'male fertile* allele at such loci results in male 
5 fertility. 

Recently it has been shown that male sterility can be induced in a plant by 
providing the genome of the plant with a chimeric male-sterility gene comprising a 
DNA sequence (or male-sterility DNA) coding, for example, for a cytotoxic product 
(such as an RNase) and under the control of a promoter which is predominantly 
10 active in selected cells of the male reproductive organs. In this regard stamen- 
selective promoters, such as the promoter of the TA29 gene of Nicotians tabacum, 

0 have been shown to be particularly useful for this purpose (Mariani et al., 1990, 

S| Nature 347:737, European patent publication ("EP") 0,344,029). By providing the 
nuclear genome of the plant with such a male-sterility gene, an artificial male-sterility 

vEl- 15 locus is created containing the artificial male-sterility genotype that results in a male- 

y= sterile plant. 

f In addition it has been shown that male fertility can be restored to the plant 

f:| with a chimeric fertility-restorer gene comprising another DNA sequence (or fertility- 
\$- restorer DNA) that codes, for example, for a protein that inhibits the activity of the 
P 20 cytotoxic product or otherwise prevents the cytotoxic product to be active in the plant 
cells (European patent publication "EP" 0,412,91 1). For example the bamase gene 
of Bacillus amyloliquefaciens codes for an RNase, the bamase, which can be 
inhibited by a protein, the barstar, that is encoded by the barstar gene of EL 
amyloliquefaciens . The bamase gene can be used for the construction of a sterility 
25 gene while the barstar gene can be used for the construction of a fertility-restorer 
gene. Experiments in different plant species, e.g. oilseed rape, have shown that a 
chimeric barstar gene can fully restore the male fertility of male sterile lines in which 
the male sterility was due to the presence of a chimeric bamase gene (EP 0,412,91 1 , 
Mariani et al., 1991 , Proceedings of the CCIRC Rapeseed Congress, July 9-1 1 , 1991 f 
30 Saskatoon, Saskatchewan, Canada; Mariani et al., 1992, Nature 357:384). By 
coupling a marker gene, such as a dominant herbicide resistance gene (for example 
the bar gene coding for phosphinothricin acetyl transferase (PAT) that converts the 
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herbicidal phosphinothricin to a non-toxic compound [De Block et al., 1987, EMBO 
J. 6:2513]), to the chimeric male-sterility and/or fertility-restorer gene, breeding 

} systems can be implemented to select for uniform populations of male sterile plants 
(EP 0,344,029; EP 0,412,911). 
5 The production of hybrid seed of any particular cultivar of a plant species 

requires the: 1) maintenance of small quantities of pure seed of each inbred par nt, 
and 2) the preparation of larger quantities of seed of each inbred parent. Such larger 
quantities of seed would normally be obtained by several (usually two) seed 
multiplication rounds, starting from a small quantity of pure seed ("basic seed") and 

10 leading, in each multiplication round, to a larger quantity of pure seed of the inbred 
parent and then finally to a stock of seed of the inbred parent (the "parent seed" or 
"foundation seed") which is of sufficient quantity to be planted to produce the desired 
quantities of hybrid seed. Of course, in each seed multiplication round larger planting 
areas (fields) are required. 

15 



Bamase is the ribonuclease which is secreted by Bacillus amyloliquefaciens 
and barstar is the inhibitor of bamase that is produced by the same microorganism 
(Hartley, 1988, J.Mol.Biol. 202:913-915). Several mutant bamase and barstar 
proteins have been described (Hartley, 1993, Biochemistry 32:5978-5984; Schreiber 
20 and Fersht, 1993, Biochemistry 32:5145-5150; Guillet et al, 1993, Current Biology 
1 : 165-1 77; Hartley, 1989, TIBS 14:450-454; Axe et al, 1996, PNAS 93:5590-5594; 
Serrano, 1993, J.Mol.Biol. 233:305-312). 

Some of these mutants were snWn to essentially retain the biological activity of the 
^ barnase and barstar as produceaSby Bacillus amyloliguefaciens . However, at least 
25 two mutant barstars have been described that have no detectable barstar activity 
(Hartley, 1 993, Biochemistry 32:5978-5^84; Guillet et al, 1 993, Current Biology 1 :1 65- 
177). 

Also other related microorganisms are known to produce proteins that are highly 
similar to barnase and barstar. Thus Bacillus intermedius produces binase and 
30 binstar (Schulga et al, 1992, NAR 20:2375; Guillet et al, 1993, supra ). 
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Summary of the Invention 

The present invention provides improved barstar DNAs that encode a barstar 
with an amino acid sequence that starts with Met-Xaa wherein Xaa is Alanine, Valine, 
5 Glycine, Aspartic acickor Glutamic acid. Preferably the barstar DNAs encode barstar 
having an amino acid sequence which is 1) the amino acid sequence of SEQ ID No 
1 2 in which the second \mino acid is not Lysine but is Alanine, Valine, Glycine, 
Aspartic acid or Glutamic gpid; 2) the amino acid sequence of SEQ ID No 4; or the 
amino acid sequence of SEQ, ID No 4 in which the second amino acid is not Alanine, 
10 but is Valine, Glycine, Asparti\acid or Glutamic acid. 



20 



The present invention further provides improved synthetic barstar DNAs that 
contain less than 40% A and T nucleotides and/or that have a codon usage that is 
optimized for oilseed rape, cotton, maize, rice and wheat, preferably for oilseed rape, 
. 1 5 maize and rice. Preferably the synthetic barstar DNAs contain no more than 7% CG 
dinucleotides and/or no more than 9.5% of CNG trinucleotides. A preferred synthetic 
barstar DNA encodes a barstar having the amino acid sequence of SEQ ID No. 4. A 
particularly preferred synthetic barstar DNA has the nucleotide sequence of SEQ ID 
No 3. 



The present invention also provides: chimeric genes in which the improved 
barstar DNAs are operably linked to a- plant expressible promoter, preferably a 
promoter that directs expression selectively in stamen cells and that directs 
expression at least in tapetum cells; plant cells and plants comprising these chimeric 
25 genes. 

The present invention further provides uses of the improved barstar DNAs and 
imporved barstar proteins to neutralize bamases in plant cells, particularly with regard 
to restoration of male fertility to male-sterile lines. 
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The present invention also provides improved barstars having an amino acid 
sequence that starts with Met-Xaa where Xaa is Alanine, Valine, Glycine, Aspartic 
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Detailed Description of the Invention 

A male-sterile ("ms") plant as used herein is a plant of a given plant species 
which is male-sterile due to expression of a chimeric male-sterility gene (S), 
integrated in the nuclear DNA of that plant and comprising the following operably 
linked DNA fragments: 

1) A "sterility promoter" which directs expression selectively in stamen 
cells, and preferably at least in tapetum cells, and, 

2) A "male-sterility DNA" coding for a bamase (the "barnase DNA"). 

An example of such a male-sterility gene is a gene comprising a bamase DNA under 
control of the promoter of the TA29 gene from tobacco, as for instance contained in 
plasmid pVE108 (WO 92/29696). 

A restorer plant as used herein is a male-fertile plant of the same plant species that 
contains integrated in the nuclear DNA of its cells a fertility restorer gene (R) 
comprising: 

1 ) A "restorer promoter" which directs expression at least in those stamen 
cells in which the sterility promoter directs expression of the bamase DNA in 
the ms plant, and, 

2) A "restorer DNA" coding for a barstar (the "barstar DNA"). 

In the restorer plants of this invention the barstar DNA is the improved barstar DNA 
as described below. 

The presence of the fertility restorer gene in those progeny plants of a male-sterile 
plant that contain the male-sterility gene, restores the male fertility in those progeny 
plants. The progeny plants are of course obtained from a cross between a male- 
sterile plant and the restorer plant. 

A restored plant as used herein is a plant (of the same species as the male-sterile, 
plant and the restorer plant) that is male-fertile and that contains within its genome 
the male-sterility gene and the fertility-restorer gene, particularly the fertility restorer 
gene comprising the improved barstar DNA of this invention. 

A line is the progeny of a given individual plant. The plants of a given line 
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resemble each other in one or more particular genetic and/or phenotypic 
characteristics. As used herein a male-sterile (ms) line is a group of plants of a plant 
species, particularly of a plant variety, which are all male-sterile due to the presence 
of a particular male-sterility gene at the same genetic locus. Similarly, a restorer line 
5 is a group of plants of a plant species which all contain the particular fertility restorer 
gene at the same genetic locus. Preferably all the plants of a ms line (respectively a 
restorer line) have the same genotype with respect to the male-sterility locus 
(respectively the fertility restorer locus). 

Male fertility is restored to a male-sterile line by introducing the fertility restorer gene 
1 0 in the ms line, e.g. by crossing of the plants of the ms line with plants of the restor r 

line so that at least some of the progeny plants will be restored plants. 

The genetic background of a line of a variety designates the totality of genes 

present in the variety that determine the particular phenotypic characteristics of that 

variety. A foreign gene, such as a male-sterility gene or a restorer gene, introduced 
15 by genetic engineering to produce a particular line of a variety can thus be introduced 

in different varieties (or even in different species), each having a different genetic 

background. 

For the production of hybrid seed the male-sterile line is also called the female or first 
parent line, and the male-fertile (restorer) line is also called the male or second 
20 parent line. 

A gene as used herein is generally understood to comprise at least one coding 
region coding for an RNA, protein or polypeptide which is operably linked to suitable 
promoter and 3' regulatory sequences. 
25 For the purpose of this invention the expression of a gene, such as a chimeric 

gene, will mean that the promoter of the gene directs transcription of a DNA into a 
RNA which is biologically active i.e. which is either capable of interacting with anoth r 
RNA or protein, or which is capable of being translated into a biologically active 
polypeptide or protein. 

30 

The phenotype is the external appearance of the expression (or lack of 
expression) of a genotype i.e. of a gene or set of genes (e.g. male-sterility, presence 
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of protein or RNA in specific plant tissues etc.). 

A bamase as used herein is any protein which is capable of degrading single- 
stranded RNA and which comprises the amino acid sequence of bamase (secret d 
barnase) as secreted by Bacillus amyloliquefaciens (Hartley, 1988, J.Mol. Biol. 
202:913) or an amino acid sequence having at least 80 %, preferably at least 85% 
sequence identity with this sequence. Barnases, as used herein are capable of 
degrading RNA by a reaction which involves the initial cleaving of the phosphodiester 
bond between the 5' ribose of one nucleotide and the phosphate group attached to 
the neighbouring 3' nucleotide. The initial product of this reaction is a 2\3'-cyclic 
phosphate intermediate which is subsequently hydrolyzed to the corresponding 3' 
nucleoside phosphate. Barnases are also capable of hydrolyzing 
polyethenoadenosine phosphate to yield a highly fluorogenic nucleotide analogue 
1,N-ethenoadenosine (Fitzgerald and Hartley, 1993, AnaLBiochem. 214:544-547) 
and have at least 10%, preferably at least 50%, particularly at least 75% of the 
activity of secreted bamase as measured under standard conditions (Fitzgerald and 
Hartley* 1993, AnaLBiochem. 214:544-547; Hartley, 1993, Biochemistry 
32:5978:5984). Barnases are further capable of specific binding to wild-type barstar 
(see below) with a dissociation constant of 10 12 M or less, preferably with a 
dissociation constant of the order of 10 " 13 M to 10 * 14 M (Schreiber and Fersht, 1993, 
Biochemistry 32:5145-5150; Hartley, 1993, Biochemistry 32:5978-5984). 
Binase is the extracellular ribonuclease secreted bv Bacillus intermedius (Schulga et 
al, 1992, NAR 20:2375) and is also considered to be a bamase as used in this 
invention. 

For convenience bamase, as used in the description or in the Examples below, will 
designate a protein having the amino acid sequence of the bamase encoded by 
pVE1 08 (WO 92/09696). 

A barstar is any protein that is capable of specific binding to secreted bamase 
with a dissociation constant of 10" 12 M or less, preferably with a dissociation constant 
of the order of 10" 13 M to 10 * 14 M (Schreiber and Fersht, 1993, Biochemistry 32:5145- 
5150; Hartley, 1993, Biochemistry 32:5978-5984). Barstars are capable of inhibiting 
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at least 50%, particularly at least 75%, more particularly at least 90% of the activity 
of secreted bamase in an equimolar mixture of barstar and secreted bamase in 
standard conditions (Hartley, 1993, Biochemistry 32:5978-5984). A barstar is a 
protein comprising the amino acid sequence of SEQ ID No 2 or an amino acid 
sequence having at least 80 %, preferably at least 85% sequence identity with this 
sequence. Wild type barstar is the barstar produced by Bacillus amvloliquefaciens 
and having the amino acid sequence of SEQ ID No 2 (see also Hartley, J.MoI. Biol. 
1988 202:913). It goes without saying that barstars as used herein include for 
example the biologically active barstar mutants described by Hartley (1993, 
Biochemistry 32:5978-5984). 

A barnase DNA (or barnase coding sequence) as used herein is any DNA 
fragment having a nucleotide sequence coding for a bamase. A particularly preferred 
bamase DNA is the bamase DNA as present in pVE108 (WO 92/09696). A bamase 
gene is a plant-expressible chimeric gene comprising a bamase DNA operably linked 
to suitable 5 1 and 3* regulatory regions, i.e. a promoter region comprising a promoter 
recognized by the polymerases of a plant cell and a 3' region comprising a plant 
polyadenylation site. 

A barstar DNA (or barstar coding sequence) as used herein is any DNA 
fragment having a nucleotide sequence coding for a barstar. A wild type barstar DNA 
is the DNA which codes for wild-type barstar and which has the nucleotide sequence 
of SEQ ID No 1. (Hartley, J.MoI. Biol. 1988 202:913). A barstar gene is a plant- 
expressible chimeric gene comprising a barstar DNA operably linked to suitable 5' 
and 3' regulatory regions, i.e. a promoter region comprising a promoter recognized 
by the polymerases of a plant cell and a 3' region comprising a plant polyadenylation 
site. 

As used herein, a genetic locus is the position of a given gene in the nuclear 
genome, i.e. in a particular chromosome, of a plant. Two loci can be on different 
chromosomes and will segregate independently. Two loci can be located on th 
same chromosome and are then generally considered as being linked (unless 
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sufficient recombination can occur between them). 

An endogenous locus is a locus which is naturally present in a plant. A for ign locus 
is a locus which is formed in the plant because of the introduction, by means of 
genetic transformation, of a foreign DNA. 
5 In diploid plants, as in any other diploid organisms, two copies of a gene are present 
at any autosomal locus. Any gene can be present in the nuclear genome in several 
variant states designated as alleles. If two identical alleles are present at a locus that 
locus is designated as being homozygous, if different alleles are present, the locus 
is designated as being heterozygous. The allelic composition of a locus, or a set of 

10 loci, is the genotype. Any allele at a locus is generally represented by a separate 
symbol (e.g. R and S and - representing the absence of the gene). A foreign 
locus is generally characterized by the presence and/or absence of a foreign DNA. 
A dominant allele is generally represented by a capital letter and is usually associated 
with the presence of a biologically active gene product (e.g. a protein) and an 

15 observable phenotypic effect (e.g. R indicates the production of an active barstar 
protein). 

A plant can be genetically characterized by identification of the allelic state of at least 
one genetic locus. 

20 The genotype of any given locus can be designated by the symbols for the two 

alleles that are present at the locus (e.g. R/R or S/-). The genotype of two unlinked 
loci can be represented as a sequence of the genotype of each locus (e.g. S/-,R/-) 



A male sterite plant as used herein, contains a foreign "male-sterility locus" 
*n£5 which contains the nrale-sterility gene S which when expressed in cells of the plant 
make the plant male-srerile without otherwise substantially affecting the growth and 
development of the plant. 

The male-sterility locus preferably also comprises in the same genetic locus 
at least one first marker gene which comprises at least: 
30 1) a first marker DNA encoding a first marker RNA, protein or polypeptide 

which, when present at least in a specific tissue or specific cells of the plant, 
renders the plant easily separable from other plants which do not contain the 



WO 98/10081 PCT/EP97/04739 

-10- 

first marker RNA, protein or polypeptide encoded by the first marker DNA at 
least in the specific tissue or specific cells, and, 

2) a first marker promoter capable of directing expression of the first marker 
DNA at least in the specific tissue or specific cells: the first marker DNA being 
in the same transcriptional unit as, and under the control of, the first mark r 
promoter. 

Such a male-sterility gene is always a dominant allele at such a foreign male- 
sterility locus. The recessive allele corresponds to the absence of the male-sterility 
gene in the nuclear genome of the plant. 

SteriliW promoters that can be used in the male-sterility genes in the first 
parent line ofViis invention have been described before (EP 0,344,029 and EP 
0,412,91 1). The sterility promoter can be any promoter but it should at least be active 
in stamen cells, pakicularly tapetum cells. Particularly useful sterility promoters are 
promoters that are selectively active in stamen cells, such as the tapetum-specific 
promoters of the TA29 gene of Nicotiana tabacum (EP 0,344,029) which can be used 
in tobacco, oilseed rape, Nettuce, cichory, com, rice, wheat and other plant species; 
the PT72, the PT42 and PE1 promoters from rice which can be used in rice, com, 
wheat, and other plant spefcies (WO 92/13956) ; the PCA55 promoter from com 
which can be used in com, ricev wheat and other plant species (WO 92/13957); and 
the A9 promoter of a tapetum-specific gene of Arabidopsrs thaliana (Wyatt et al M 
1 992, Plant Mol. Biol. 13:61 1 -92ZK 

The present invention is based on the finding that the "restorer capacity" (the 
ability and efficiency of a restorer line to restore male fertility efficiently to a wide 
variety of ms lines) is directly related to the amount of barstar produced in the 
stamen, particularly in the tapetum cells, of the restorer plants (and by implication in 
stamen, particularly the tapetum cells, of the restored plants). 
Restorer capacity is important because an efficient restorer line can be used for the 
restoration of male fertility to a wide variety of male-sterile lines, which may differ in 
the level to which bamase is produced (by expression of the male-sterility gene) in 
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the stamen, particularly in the tapetum cells. One source of such a variety of bamase 
production among different ms lines may be due to position effects, i.e. the variation 
in gene expression of the ms gene due to the different insertion places in the nuclear 
genome among different transformants. Another source of variation of bamase 
5 production among various ms lines that contain the ms gene at the same genetic 
locus is the different genetic backgrounds of the various ms lines. Thus when the ms 
gene from one male-sterile line of one variety of a plant species is introduced into 
other varieties of the same (or of a closely related) plant species by backcrossing to 
generate ms lines of those varieties the ms gene is introduced in a different genetic 

10 background which can influence the level at which the ms gene is expressed. The 
observed variation in barnase production, and the problems associated therewith with 
respect to fertility restoration, is further augmented when variation of expression of 
the restorer gene among different restorer plants is taken into account - such 
variation may originate from the same sources as indicated above, i.e. position 

15 effects of the fertility restorer gene in different restorer lines and/or the different 
genetic backgrounds of different restorer lines. Thus, in order to obtain restorer lines 
with good restorer capacity, it is desired to have plant-expressible chimeric barstar 
genes that are expressed at high levels to produce large amounts of barstar in 
stamen cells, particularly in anther cells, especially in tapetum cells of a plant. 



This invention thus provides improved barstar DNAs which, all other things 
being equal, are more efficiently expressed in plant cells - thereby producing in those 
cells higher levels of active barstar protein, particularly improved barstar protein - 
than wild-type barstar DNAs. Thus, improved fertility restorer genes are expressed, 

25 in plants of at least one plant species (and preferably in plants of several plant 
species, particularly several monocot plant species) at a level which is on the average 
higher than the level observed with similar restorer genes comprising wild-type 
barstar DNA. An average higher level of expression means that the amount of barstar 
produced in particular organs (e.g. anthers) of different restorer lines containing the 

30 fertility restorer gene of this invention at different genetic loci and/or within different 
genetic backgrounds is significantly higher than the amount of barstar produced in 
the stamen of different restorer lines containing similar fertility restorer genes 
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comprising wild-type barstar DNA at different genetic loci and within different gen tic 
background. 

Generally the invention provides "improved" barstar coding regions that have 
a nucleotide sequence containing as a second codon a codon that codes for an 
amino acid selected from the group of Valine (Val) t Alanine (Ala), Aspartic acid (Asp), 
Glutamic acid (Glu) and Glycine (Gly). It is particularly preferred that this second 
codon encodes Alanine. These codons start with a G which provides for an optimal 
translation initiation context at position +4 (i.e. the first nucleotide of the second 
codon of the barstar coding sequence). Thus the N-terminus of the barstar, encoded 
by the improved barstar DNA, consists of Met-Xaa in which Xaa is Ala, Val, Gly, Glu, 
or Asp, and is preferably Ala (more preferably an Ala encoded by a GCG codon). 
Preferably the improved barstar DNAs encode a barstar comprising the amino acid 
sequence of SEQ ID No 2 between amino acids residues 3 and 90. Examples of 
improved barstars DNAs are 1) a barstar DNA encoding a barstar having the 
sequence of SEQ ID No 2 in which the second amino acid (Lys) is replaced by Xaa 
as defined above, and 2) a barstar DNA encoding a barstar having the amino acid 
sequence of SEQ ID No 4. It was found that modifying the N-terminus of the barstar 
protein in a nonconservative way did not negatively affect the specific biological 
activity of barstar. In fact genes comprising these improved barstar DNAs, when 
expressed in plants, particularly when expressed in monocots (such as com, rice and 
wheat), generally produced more barstar protein as assessed by Western blotting 
of stamen tissues. This increase in barstar production in tapetum cells may be due 
to improved translation but also to improved stability of the barstar protein. 

It was also found that modifying a barstar coding sequence to reduce the %AT 
of this sequence below 40% resulted in an improved barstar DNA that could be used 
to considerably increase the level of production of protein with barstar activity in plant 
cells, particularly in tapetum cells. Such improved barstar coding sequences will 
herein be generally designated as "synthetic" barstar coding sequences. 

It is preferred that the synthetic barstar DNAs have a codon usage which is 
preferred in the majority of plant species in which it is intended to be used, e.g. in 



WO 98/10081 PCT/EP97/04739 

-13- 

restorer lines. A preferred codon usage of a barstar DNA for a majority of N plant 
species X1, X2, .... XN means that for each amino acid, for which more than one 
codon xists, the most frequent codon for that amino acid in the synthetic barstar 
DNA (the "barstar codon" for that amino acid), preferably every codon for that amino 
5 acid, is a codon that in each of more than N/2 plant species has an overall frequency 
that is 1) at least twice the overall frequency of the least used codon and/or 2) more 
than half of the overall frequency of the most used codon. 

It is also preferred that for at least 17, preferably for at least 1 8 of the 1 9 amino acids 
for which multiple codons exist, the most frequently used codon, preferably every 
10 codon, in the synthetic barstar DNA is a codon that in each of the N plant species 
has an overall frequency that is 1) at least twice the overall frequency of the least 
used codon and/or 2) more than half of the overall frequency of the most used codon. 
*jjf Example 2 describes the design of a particular synthetic barstar DNA with optimized 

codon usage with respect to five plant species (N=5) i.e. oilseed rape, cotton, com, 
1 5 wheat and rice. 

For the purpose of this invention the overall codon frequencies for various plant 
species as published by Ikemura are used (Ikemura, 1993, In "Plant Molecular 
Biology Labfax", Croy, ed., Bios Scientific Publishers Ltd., pp. 37-48). 
Preferred plant species in which the synthetic barstar DNAs of this invention are 
20 used, e.g. in restorer lines, are oilseed rape, cotton, maize, rice, and wheat, 
particularly oilseed rape, maize and rice, especially oilseed rape and maize. 

The synthetic barstar DNAs are preferably further characterized by having CG 
dinucleotides and CNG trinucleotides, wich are targets for methylation in plant c lis, 
at low frequencies. Thus the synthetic barstar DNAs of this invention are 
25 characterized by: 

having not more that 7% of CG dinucleotides, preferably having not 
more than 6 % of CG dinucleotides, particularly having between 5.5 
and 6% CG dinucleotides (which is about 1 5 or 1 6 CG nucleotides in 
a coding sequence of 270-273 bp), and/or, 
30 - having not more than 9.5% CNG trinucleotides (where N is any 

nucleotide), preferably having not more than 9% CNG trinucleotides, 
particularly having between 8.5 and 9% CNG trinucleotides (which is 
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about 23-25 CNG trinucleotides in a coding sequence of 270-273 bp). 
The synthetic barstar DNAs of this invention are further preferably 
characterized by having no more than 7, preferably no more than 5, particularly no 
more than 3 tetranucleotides consisting of only one kind of nucleotide (i.e. AAAA, 
5 CCCC, GGGG, lilt) and having no more than 2, preferably no more than 1 
pentanucleotide consisting of only one kind of nucleotide (i.e. AAAAA, CCCCC, 
GGGGG, I I I I I). 



Synthetic barstar DNAs of this invention include those coding for wild-type 
10 barstar (SEQ ID No 2), e.g. a barstar DNA having the nucleotide sequence of SEQ 
ID No 3 between positions 7 and 273, preceded by ATG. However, preferred 
synthetic barstar DNAs of this invention are synthetic barstar coding sequences that 
^ encode a barstar having an amino acid sequence starting with Met-Xaa wherein Xaa 

is Valine, Alanine, Aspartic acid, Glutamic acid or Glycine, and preferably wherein 
15 Xaa is Alanine. Preferred synthetic barstar DNAs are DNAs having the nucleotide 
sequence of SEQ ID No 3 between positions 7 and 273, preceded by ATGGNN, 
preferably by ATGGCN (where N is any nucleotide). A particularly preferred synthetic 
barstar DNA is a DNA having the nucleotide sequence of SEQ ID No 3. 
(ft Preferably, the synthetic barstar coding sequence and the wild type barstar 

12 20 coding sequence code for barstar proteins having at least 80% sequence identity. 

For the purpose of this invention the % sequence identity of two related 
nucleotide sequences (e.g. two barstar DNAs) or amino acid sequences (i.e. two 
barstars) refers to the number of positions in the two optimally aligned sequenc s 
25 which have identical residues (x1 00) divided by the number of positions compar d. 
A gap, i.e. a position in an alignment where a residue is present in one sequence but 
not in the other is regarded as a position with non-identical residues. 



9 



30 



Restorer genes comprising the synthetic barstar DNAs of this invention may 
be used for producing restorer lines in different plant species but are particularly 
useful for producing restorer lines in cereals, especially in corn, rice and wheat. 
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This invention thus also provides improved barstar proteins which have an N- 
terminus which consists of Met-Xaa (where Xaa is as defined above). Particularly 
preferred improved barstars of this invention comprise the sequence of SEQ ID No 
2 between residues 3 and 90, and are, for example: 1) a barstar having the amino 
acid sequence of SEQ ID No 4, and 2) a barstar having the amino acid sequence of 
SEQ ID No. 2 wherein the second amino acid (Lys) is replaced by Xaa as defined 
above. 

Furthermore it goes without saying that the region of these preferred improved 
barstars that corresponds to the sequence beween residues 3 and 90 of SEQ ID No 
2 may be modified, as long as the overall amino acid sequence has at least 80%, 
preferably at least 85%, particularly at least 90% sequence similarity with SEQ ID No 
2, and as long as the improved barstar is capable of inhibiting at least 50%, 
preferably at least 75%, particularly at least 90% of the activity of secreted barnase 
under standard conditions. 

As indicated above the modification at the N-terminus of a barstar protein (i.e. 
the introduction of an N-terminus consisting of Met-Xaa-...) is a nonconservative 
modification, which however does not negatively affect the specific biological activity 
of the improved barstars of this invention when they are produced in plant cells. This 
is exemplified for the improved barstar of SEQ ID No 4 in example 6. 

This invention thus also provides fertility restorer genes which are 
characterized in that they contain the improved and/or synthetic barstar DNAs of this 
invention, and which can be used to produce improved transgenic restorer plants of 
a plant species, i.e. restorer plants with good restorer capacity. It goes without saying 
that the barstar DNA in the fertility restorer gene may be translationally fused to other 
coding sequences so that it will be expressed as part of a fusion protein. 

In principle any promoter can be used as a restorer promoter in the fertility 
restorer gene in the restorer plant of this invention. The only prerequisite is that such 
restorer plant which contains the fertility-restorer gene, is capable of restoring th 
fertility to a male-sterile line, i.e. of producing restored plants (comprising both the 
male-sterility gene and the fertility restorer gene) that are phenotypically normal and 
male-fertile. This requires that the restorer promoter in the fertility-restor r gene 
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should be at least active in those stamen cells of a plant in which the sterility 
promoter of the corresponding male-sterility gene can direct expression of the 
barnase DNA. In this regard it will be preferred that the sterility promoter and the 
restorer promoter are the same (e.g. both TA29 promoter or both CA55 promot r). 
However, the sterility promoter may be active only in stamen cells while the restorer 
promoter is also active in other cells. For instance, the sterility promoter can be a 
stamen-selective (such as the TA29 or CA55 promoter) while the restorer promoter 
is a constitutive promoter such as the 35S-tap promoter which is a 35S promoter that 
is modified to be active in tapetum cells (van der Meer et al ( 1992, the Plant Cell 
4:253-262). 

Of course, the improved restorer DNAs of this invention may also be used for 
other purposes than restoration of male fertility in plants. In this regard the improved 
and/or synthetic barstar DNAs of this invention may be used in any circumstance 
where neutralizing the activity of barnase in plant cells may be useful, such as for 
example described in EP 0,412,911, WO 93/19188, WO 92/21757, WO 93/25695 
and WO 95/02157. For these uses the barstar DNA may be placed under the 
transcriptional control of other promoters that are more suitable, and promoters like 
the 35S promoter or the promoter of the nopaline synthase gene of Aorobacterium 
T-DNA may be used. Minimal promoters (i.e. plant promoters essentially containing 
only a TATA box without any other regulatory enhancer elements) may also be us d 
and the barstar DNAs of this invention may even be used without any promoter at all 
(for instance when it is intended that the barstar DNA is transcribed under the control 
of an endogenous promoter in transformed plant cells). 

The fertility restorer gene R as used in the restorer plant preferably also 
comprises at least a second marker gene which comprises at least: 

1) a second marker DNA encoding a second marker RNA, protein or 
polypeptide which, when present at least in a specific tissue or specific cells 
of the plant, renders the plant easily separable from other plants which do not 
contain the second marker RNA, protein or polypeptide encoded by the 
second marker DNA at least in the specific tissue or specific cells, and, 
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2) a second marker promoter capable of directing expression of the second 
marker DNA at least in the specific tissue or specific cells: the second marker 
DNA being in the same transcriptional unit as, and under the control of, the 
second marker promoter. 

5 

Thus a restorer plant of this invention contains a foreign "restorer locus" which 
contains the restorer gene R comprising the improved restorer DNA of this invention. 

The restorer locus preferably also comprises in the same genetic locus at least 
one second marker gene. 

10 

Preferred restorer plants of this invention are monocot restorer plants, 
preferably corn, rice or wheat plants, that produce, on the average, at least 10 ng, 
preferably at least 20 ng, particularly at least 40 ng (and up to 100 to 200 ng) barstar 
per mg total protein extracted from their isolated inflorescences (e.g. panicles in rice 
1 5 and wheat, tassels in corn - see e.g. Example 4). 

Especially preferred restorer plants of this invention will produce the improved 
barstars of this invention, particularly barstar having the amino acid sequence of SEQ 
ID No. 4. 



20 First and second marker DNAs and first and second marker promoters that 

can be used in the first and second marker genes of this invention are also well 
known (EP 0,344,029; EP 0,412,91 1). In this regard it is preferred that the first and 
second marker DNA are different, although the first and second marker promoter may 
be the same. 

25 

Foreign DNA such as the fertility-restorer gene, the male-sterility gene, or the 
first or second marker gene preferably also are provided with suitable 3' transcription 
regulation sequences and polyadenylation signals, downstream (i.e. 3') from their 
coding sequence i.e. respectively the fertility-restorer DNA, the male-sterility DNA, or 
30 the first or second marker DNA. In this regard foreign transcription 3' end formation 
and polyadenylation signals suitable for obtaining expression of the chimeric g ne 
can be used. For example, the foreign 3' untranslated ends of genes, such as gene 
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7 (Velten and Schell (1985) Nucl. Acids Res. 13:6998), the octopine synthase gene 
(De Greve et al., 1 982, J.MoI. Appl. Genet. 1 :499; Gielen et al (1 983) EMBO J. 3:835; 
Ingelbrecht et al., 1989, The Plant Cell 1 :671) and the nopaline synthase gene of the 
T-DNA region of Aarobacterium tumefaciens Ti-plasmid (De Picker et aL, 1982, 
5 J.Mol. Appl. Genet. 1:561), or the chalcone synthase gene (Sommer and Saedler, 
1 986, Mol.Gen.Genet. 202:429-434), or the CaMV 1 9S/35S transcription unit (Mog n 
et al., 1990, The Plant Cell 2:1261-1272) can be used. 

The fertility-restorer gene, the male-sterility gene, or the first or second marker 
10 gene in accordance with the present invention are generally foreign DNAs, preferably 
foreign chimeric DNA. In this regard "foreign" and "chimeric" with regard to such 
0 DNAs have the same meanings as described in EP 0,344,029 and EP 0,412,91 1. 

at 

SI The cell of a plant, particularly a plant capable of being infected with 

y| "1 5 Agrobacterium such as most dicotyledonous plants (e.g. Brassica napus ^ and some 
l M monocotyledonous plants, can be transformed using a vector that is a disarmed Ti- 

b plasmid containing the male-sterility gene or the fertility restorer gene and carried by 

p Aarobacterium. This transformation can be carried out using the procedures 

O described, for example, in EP 0,116,718 and EP 0,270,822. Preferred Ti-plasmid 

© 20 vectors contain the foreign DNA between the border sequences, or at least located 
to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, 
other types of vectors can be used to transform the plant cell, using procedures such 
as direct gene transfer (as described, for example, in EP 0,233,247), pollen mediated 
transformation (as described, for example, in EP 0,270,356, PCT patent publication 
25 "WO" 85/01 856, and US patent 4,684,61 1 ), plant RNA virus-mediated transformation 
(as described, for example, in EP 0,067,553 and US patent 4,407,956) and liposome- 
mediated transformation (as described, for example, in US patent 4,536,475). Cells 
of monocotyledonous plants such as the major cereals including com, rice, wheat, 
barley, and rye, can be transformed (e.g. by electroporation) using wounded or 
30 enzyme-degraded intact tissues capable of forming compact embryogenic callus 
(such as immature embryos in com), or the embryogenic callus (such as type I callus 
in com) obtained thereof, as described in WO 92/09696. In case the plant to be 



WO 98/10081 PCT/EP97/04739 

- 19- 

transformed is com, other recently developed methods can also be used such as, for 
example, the method described for certain lines of com by Fromm et al., 1990, 
Biotechnology 8:833; Gordon-Kamm et aL, 1990, Bio/Technology 2:603 and Gould 
et al., 1 991 , Plant Physiol. 95:426. In case the plant to be transformed is rice, recently 
developed methods can also be used such as, for example, the method described 
for certain lines of rice by Shimamoto et al., 1989, Nature 338:274; Datta et al. f 1990, 
Bio/Technology 8:736; and Hayashimoto et al., 1990, Plant Physiol. 93:857. 

The transformed cell can be regenerated into a mature plant and the resulting 
transformed plant can be used in a conventional breeding scheme to produce more 
transformed plants with the same characteristics or to introduce the male-sterility 
gene, or the fertility-restorer gene in other varieties of the same related plant species. 
Seeds obtained from the transformed plants contain the chimeric gene(s) of this 
invention as a stable genomic insert. Thus the male-sterility gene, or the fertility- 
restorer gene of this invention when introduced into a particular line or variety of a 
plant species can always be introduced into any other line or variety by backcrossing. 

The above-described method for reducing the AT content of a coding DNA 
while optimizing the codon usage of that coding DNA for a series of of plant species, 
as used for preparing the synthetic barstar DNAs of this invention, can of course also 
be applied to any coding sequence for which optimal expression is desired in a 
number of plant species. In this regard, the method can for example be applied to 
genes from Bacillus th uringiensis that encode insecticidal proteins, such as the full- 
length and truncated Crylab and Cry9C genes (EP 0,193,259; EP 0,654,075). 

Unless otherwise indicated all experimental procedures for manipulating 
recombinant DNA were carried out by the standardized procedures described in 
Sambrook et al., 1 989, "Molecular Cloning: a Laboratory Manual", Cold Spring Harbor 
Laboratory, and Ausubel et al, 1994, "Current Protocols in Molecular Biology", John 
Wiley & Sons. 

The polymerase chain reactions ("PCR") were used to clone and/or amplify DNA 
fragments. PCR with overlap extension was used in order to construct chimeric gen s 
(Horton et al, 1989, Gene 77:61-68; Ho et al, 1989, Gene 77:51-59). 



WO 98/10081 ' PCT/EP97/04739 

-20- 

All PCR reactions were performed under conventional conditions using the Vent™ 
polymerase (Cat. No. 254L - Biolabs New England, Beverley, MA 01915, U.S.A.) 
isolated from Thermococcus litoralis (Neuner et al., 1990, Arch. Microbiol. 1 53 :205- 
207). Oligonucleotides were designed according to known rules as outlined for 
example by Kramer and Fritz (1987, Methods in Enzymology 154:350), and 
synthesized by the phosphoramidite method (Beaucage and Caruthers, 1981, 
Tetrahedron Letters 22:1859) on an applied Biosystems 380A DNA synthesizer 
(Applied Biosystems B.V., Maarssen, Netherlands). 

In the description and in the following examples, reference is made to the 
following sequence listing and figures: 



Seauence Listina 




SEQ ID No 1 


: wild type barstar DNA 


SEQ ID No 2 


: wild type barstar 


SEQ ID No 3 


: synthetic barstar DNA 


SEQ ID No 4 


: improved barstar 


SEQ ID No 5 


: plasmid pMV71 


SEQ ID No 6 


: relevant part of plasmid pLH43 


SEQ ID No 7 


: T-DNA of plasmid pTTS24 


SEQ ID No 8 


: oligonucleotide CASOLX1 


SEQ ID No 9 


: oligonucleotide CASOLX2 


SEQ ID No 10 


: plasmid pLH48 



When making reference to nucleotide or amino acid sequences, it should be 
understood that a sequence between position X and Y (or alternatively a sequence 
from position X to position Y) is a sequence that includes the residues at position X 
and Y. 

Functional DNA elements are designated using the abbreviations as used in th 
sequence listing. 
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Example 1 : Preparati on of an improved barstar DNA. 

An improved barstar DNA was prepared by inserting by site directed mutagenesis the 
Alanine codon GCC between the first codon (ATG) and the second codon (AAA) of 
wild-type barstar. The improved barstar DNA thus encodes an improved barstar 
which, when compared to wild-type barstar has an altered N-terminus which consists 
of Met-Ala-Lys (instead of Met-Lys in wild-type barstar). This nonconservative 
modification at the N-terminus does not affect the biological activity of the improved 
barstar in plant cells (see Example 6). 

Example 2 : Design and preparation of a synthetic barstar DNA 

A synthetic barstar DNA encoding the improved barstar of Example 1 was designed 

according to the following criteria: 

the percent A and T nucleotides in the synthetic barstar should be below 40% 
(wild-type barstar DNA has a %AT of 51.6). AT rich genes have a higher 
likelihood of including polyadenylation signals and intron recognition 
sequences which can prevent or decrease expression of especially long 
coding sequences (e.g. Bt coding sequences) in plant cells, 
despite the increase of C and G nucleotides the number of CG dinucleotides 
and CNG trinucleotides should remain as low as possible. This is because CG 
dinucleotides and CNG trinucleotides are targets for methylation which can 
inactivate a gene in plant cells. 

the codon usage should be as optimal as possible in a wide range of plant 
species, particularly in oilseed rape, cotton, corn, rice and wheat. Therefore 
for each amino acid a codon (or codons) were selected for predominant use 
in the barstar DNA that in the majority of plant species has a frequency which 
is at least more than twice the frequency of the least used codon and/or are 
at least more than half the frequency of the most used codon. Thus, for each 
plant species and for each amino acid a list of codons is made which have a 
. frequency which is at least more than twice the frequency of the least us d 
codon in that plant species and/or are at least more than half the frequency 
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of the most used codon in that plant species (codons complying with this 
criterium are designated as optimal codons for that plant species). The codon 
that is an optimal codon in the majority of plant species is taken as th 
predominant codon, preferably the only codon, for that amino acid in th 
5 synthetic barstarDNA. 

Conveniently, the frequencies of codons used in this analysis are those listed 
in Ikemura ( supra ). 

avoid stretches of longer than 4 identical nucleotides. 

10 A synthetic barstar DNA with the nucleotide sequence of SEQ ID No. 3 was 

thus obtained. This synthetic barstar DNA has an %AT of 38.4, and nevertheless 
contains only 16 CG dinucleotides and 24 CNG trinucleotides. The synthetic barstar 
DNA does not contain potential plant polyadenylation signals or intron splice sites, 
w The codon usage of the synthetic barstar DNA appears suitable for expression in at 

ry 15 least oilseed rape, cotton, com, rice and wheat (Table 1). For the 18 amino acids for 
[** which multiple codons exist 17 amino acids are predominantly encoded (and 16 are 

H exclusively encoded) in the synthetic barstar DNA by a codon which is an optimal 

codon in each of these five plant species. In fact al! the amino acids for which 
multiple codons exist are predominantly encoded in the synthetic barstar DNA by an 
20 optimal codon for at least three of the five species. 

The synthetic barstar DNA contains only 1 CCCCC and 1 GGGG stretch. 

The synthetic barstar of SEQ ID No 3 allows use of the following unique 
restriction sites for cloning purposes: Ncol, BspE1 and Kasl. 

25 Example 3 : Expression of improved barstar DNAs in com cells 

Cultured Black Mexican Sweet (BMS) com cells were bombarded with th 
Bio-Rad PDS-1000/HE particle gun (Bio-Rad, Laboratories, Hercules, California, 
U.S.A.) using microprojectiles (gold particles) coated with either plasmid pLH43 or 
with plasmid pMV71 as follows. 
30 BMS suspension cells were placed on appropriate filters and bombarded by 
conventional procedures (Kirihara, 1994, In 'The Maize Handbook", Freeiing and 
Walbot, eds. f Springer Verlag, pp 690-694), modified by including an osmotic 
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pretreatment (Russel et ai, 1992, In Vitro 28:97-105). The gold particles were coated 
with a mixture of the appropriate plasmid DNA containing the barstar DNA and 
plasmid pAct1-D (McElroy et al, 1990, The Plant Cell 2:163-171) in a ratio 5:1 using 
the procedures described in the suppliers manual. pAct1-D is a plasmid containing 
5 the gus gene (which encodes beta-glucuronidase) under the control of the promot r 
of the rice actin gene which has essentially the sequence as disclosed in SEQ ID No 
5 from position 1 999 to 3400. 

For each plasmid two to three filters were bombarded. 24 hours after bombardment, 
the cells from each filter were collected and ruptured by grinding in liquid nitrogen. 
1 0 The ruptured cells for each filter were divided in two equal amounts, one for a barstar 
assay and one for a GUS assay. 

From one half of the ruptured cells from each bombarded filter, total soluble cellular 
protein was extracted in extraction buffer (50 mM Tris/HCI pH 7.5, 5% glycerol, 100 
mM KCI, 1 mM benzamidin.HCI, 5mM e-amino-n-caproic acid, 10 mM EDTA, 10 mM 

15 EGTA, 1 |jg/ml antipain, 1 pg/rnl leupeptin, 14 mM beta-mercaptoethanol, 1.5% 
polyvinylpyrrolidone, 1 mM PMSF). Protein concentration was measured by the 
Bio-Rad Bradford assay. Per sample 50 pg of proteins were loaded on an 1 8% SDS- 
polyacrylamide gel and separated by electrophoresis. 0.1, 0.5, 1, 2.5 and 5 ng of 
purified barstar were loaded on the gel as positive controls. 

20 Proteins were then electroblotted onto Hybond C using TGM-buffer (25 mM Tris pH 
8.3, 192 mM glycine, 20% v/v methanol) for 2 hours at 60 V. The filters were then 
incubated first for 2 hours in 0.5% PBS-Tween 20 and then overnight in a solution of 
primary antibodies (1/1000 dilution of polyclonal rabbit IgG anti-barstar antibody in 
PBS- 0,5% Tween 20). The filters were then washed four times for 5 minutes in PBS 

25 and then incubated for 1 hour in a solution of donkey anti-rabbit-IgG, horseradish 
peroxidase linked (Amersham, Buckinghamshire, Great Britain) 1/1000 dilution in 
PBS- 0.5% Tween 20). The filters were then again washed four times for 5 minutes, 
in PBS and proteins were then detected using the ECL detection system 
(Amersham). The amount of barstar was measured by densitometry scanning of the 

30 autoradiographs. 

In parallel, activity of beta-glucuronidase in proteins extracted from the other half of 
ruptured cells from each bombarded filter was measured by a fluorogenic assay 
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(Jefferson, 1987, Plant MoL Biol. Reporter 5:387-405). 

Barstar production from a particular plasmid was determined by comparing the 
amount of barstar per (arbitrary) unit GUS activity. 

pMV71 is a plasmid having the nucleotide sequence of SEQ ID No. 5 and 
contains the improved barstar DNA of Example 1. This variant barstar DNA is 
operably linked to the rice actin promoter and the 3' untranslated end of the nopaline 
synthase gene of Agrobacterium T-DNA. 

pLH43 is a plasmid having the nucleotide sequence of SEQ ID No. 5 in which 
the sequence between nucleotides 3399 and 4021 is replaced with the nucleotide 
sequence of SEQ ID No. 6. pLH43 contains the synthetic barstar DNA of Example 
2 (SEQ ID No. 3) operably linked to the rice actin promoter and the 3' untranslated 
end of the nopaline synthase gene of Agrobacterium T-DNA. 

pTS410 is a plasmid having the nucleotide sequence of SEQ ID No 5 in which 
nucleotides 3404 to 3406 (i.e. the second codon of the barstar DNA in pMV71) have 
been deleted. 

The results of the above experiment is presented in Table 2 which shows 
measurements from individual bombardments. It can easily be seen that the 
introduction of the improved barstar DNAs of pMV71 and pLH43 result on the 
average in a significantly higher production of (improved) barstar protein when 
compared to the production of (wild-type) barstar protein after introduction of the wild- 
type barstar DNA of pTS410. In addition it can be seen that the introduction of the 
improved synthetic barstar DNA of pLH43 in com cells results in a significantly higher 
production of (improved) barstar protein in those cells when compared to the 
production of (also improved) barstar protein after introduction of the improved 
barstar DNA of pMV71 . 

Example 4 : Expressi on of improved barstar DNAs in rice plants 

Four transgenic male-sterile rice lines of cultivar Kochihibiki were obtained, 
using plasmid pTS172, essentially as described in WO 92/13956. Plasmid pTS172 
contains the following chimeric genes: P35S-bar-3'g7 and PE1- barnase -3'nos and 
can be obtained from pTS173 (see below) by ligating the large fragment of pTS173, 
digested with BstEII and Mscl to the small BstEII-MscI fragment of pJVR2-E1. 
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These lines were designated as K104, K107, K109 and K111. 

Seven transgenic male fertile restorer rice plants of cultivar Chiyonishlki were 
obtained essentially as described in WO 92/13956 using plasmid pTS173 which 
contains the following chimeric genes: P35S-i2ar-3'g7 and PE1 - wild-type bflf gtar- 
3'nos. pTS173 is derive! from pJVR3-E1 (WO 92/13956) by replacing the 35S 
promoter and the 3' untranslated end of the chimeric hsi gene of pJVR3-E1 by the 
35S promoter and the 3' untranslated end of the chimeric bar gene of pTTS24 as 
follows. From the T-DNA insert dt plasmid pTTS24 (SEQ ID No. 7) a DNA fragment 
containing the 3' end of T-DNA gene 7 and part of the bar gene is amplified by PCR 
using the oligonucleotide primers CASOLX1 (SEQ ID No 8), which overlaps the Kpnl 
site in the bar gene, and CASOLX2 (SEQ ID No 9). The PCR product is cleaved with 
Aatll and Kpnl, and ligated to the large fragment of plasmid pJVR3-E1 cleaved with 
Aatll and Kpnl. From the obtained plasmid the smaller Ncol+Notl fragment, 
(containing P35S) is replaced by the corresponding Ncol-Notl fragment from pTTS24 
(positions 880 to 2281 in SEQ ID No 7), resultiNgjn pTS173. 
The resulting lines were designated as C111, C1 13, C117, C1 18, C120, C121 and 
C125. All these plants thus contain the wild type barstar DNA of SEQ ID No 1 under 
control of the E1 promoter (WO 92/13956). 

32 additional restorer plants were obtained using Agrobacterium -mediated 
transformation of wounded compact embryogenic callus (obtained from rice immature 
embryos of cultivar Kochihibiki)(WO 92/09696). The Aarobacterium strain used for 
transformation was strain Ach5 (Genetello et al, 1977, Nature 265:561-563) cur d 
from its wild-type Ti-plasmid and containing plasmids pGV4000 and pTTS24. 
pTTS24 is an intermediate cloning vector which resembles pGSC1700 (Comelissen 
and Vandewiele, 1989, NAR 17:19-29) in its essential characteristics but primarily 
differs from that plasmid by lacking the beta-lactamase gene and by containing a T- 
DNA having the nucleotide sequence of SEQ ID No 7. pGV4000 is a disarmed Ti- 
plasmid which is derived from pMP90 (Koncz and Schell, 1986, Mol.Gen.Gen t. 
204:383-396) and which contains a region which allows homologous recombination 
with a corresponding region in pTTS24. 

These plants, which all contain the synthetic barstar DNA of Example 2 (SEQ ID No 
3) under control of the E1 promoter, were designated with the "OSC" numbers as 
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The amount of barstar protein was determined in all restorer lines. Per line one 
to two immature panicles (i.e. panicles in which the majority of spikelets are 
approximately 3.5 to 4.5 mm long) of a single plant were crushed in liquid nitrogen. 
Proteins were extracted and detected as described in Example 3. 

The results are summarized in Table 3. It can be seen that the restorer lines 
containing the barstar DNA of SEQ ID No 3 produce significantly more barstar than 
those lines that contain the barstar DNA of SEQ ID No 1 . 

It was confirmed that the amount of barstar mRNA was directly related to the 
amount of barstar protein in the panicles (data not shown). It was further confirmed 
that the production of barstar protein is effectively restricted to the flowers of the 
panicles (data not shown). 

Selected restorer lines were crossed with the four male-sterile lines. It was 
observed that restorer capacity of the restorer lines was correlated with the amount 
of barstar protein produced in the flowers. Lines producing between 4 and 1 0 ng 
barstar per mg of extracted protein were observed to be able to partially restore 
microspore development to male-sterile lines in the sense that the "restored plants" 
produced nonviable pollen (male-sterile plants do not produce pollen at all). It was 
observed that restorer lines producing 50 ng or more barstar per mg of extracted 
protein were able to fully restore the fertility to all tested male-sterile lines, i.e. all 
restored progeny plants produced fully viable and functional pollen, and normal 
seedset after selfpollination. 

Selected restorer lines were also self-pollinated and immature panicles w re 
harvested from the transgenic progeny plants. The expression level, as determined 
by the amount of barstar produced, was at least equal to, and in many plants greater 
than, that determined in the primary transformants. Generally, these observations 
confirm that barstar expression level is stably transmitted to progeny plants. 
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Examole 5 : Expression of improved barstar DNAs in corn plants. 

A transgenic male-sterile com line, designated as MS3, was obtained 
essentially as described in WO 92/13956 using plasmid pVE108. 

Several transgenic male fertile restorer com plants, containing the wild-type 
barstar DNA of SEQ ID No 1 under the control of either the TA29 promoter (EP 
344,029) or the CA55 promoter (WO 92/13957) were obtained essentially as 
described in WO 95/34634. In particular seven restorer lines were obtained by 
transformation of com with plasmids pCOUOO and pDE110 (WO 92/34634) which 
contains inter alia the TA29 promoter operably linked to the barstar DNA as 
described in SEQ ID No 1. When crossed to MS3 plants, restoration was incomplete 
in that maximally 75% of the anthers produced viable pollen (this shows that the MS3 
line is especially difficult to restore as good restoration was obtained with other male 
sterile lines - data not shown). 

Similarly, six transgenic restorer com lines were obtained by transformation 
of corn with plasmid pLH48 (together with plasmids selected from pCOL9S or PLH52 
or p35S-Bperu or pCOL1 1 (WO 92/34634)). These restorer lines all contain at least 
the synthetic barstar DNA of Example 2 (SEQ ID No 3) under control of the TA29 
promoter. The six restorer lines were crossed to MS3 plants and for four restorer lines 
complete restoration was observed. One of these four restorer lines was designated 
as RZM583-0101. 

The transgenic line MS3 was pollinated by transgenic line RZM583-0101, and 
progeny plants containing both the chimeric bamase gene and the chimeric barstar 
gene were identified by PCR screening. Those fertility restored progeny plants w re 
used to pollinate silks of MS3 plants. Among 90 progeny plants of this last cross, 15 
plants were identified (by quantitative Southern blotting) to be homozygous for the 
chimeric bamase gene of MS3. Among those, plants that did not contain the chimeric 
barstar gene of RZM583-0101 were male-sterile, while those plants that contained 
the chimeric barstar gene of RZM583-0101 were fully male fertile. This demonstrat s 
that fertility of plants containing a chimeric barnase gene in homozygous condition 
can be fully restored by a chimeric barstar gene comprising the improved synth tic 
barstar DNA. 

Thus it can be concluded that the restorer lines containing the improved 
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synthetic barstar DNA have a significantly better restorer capacity. 

The amount of barstar in immature tassels (comprising microspores in the 
uninucleate stage) of progeny restorer plants was determined essentially as 
described in Example 3. It was observed that the amount of barstar protein was 
significantly higher in the tassels of restorer plants containing the improved synthetic 
barstar DNA than in those containing the wild-type barstar DNA. For example in a 
plant containing the wild-type barstar DNA, the amount of barstar protein was 
determined to be below 20 ng barstar/mg total protein. In contrast in two lines 
containing the improved synthetic barstar DNA the amount of barstar protein in 
immature tassels was determined to be respectively 210 and 100 ng barstar/mg total 
protein. 

Example 6 : Activity of wild type Barstar and improve d barstar when produced in 
stamen cells of oilseed rape. 

Oilseed rape plants (cv. Drakkar) were transformed using Agrobacterium- 
mediated transformation, with plasmids pTHW118 or pTTS139. Plasmid pTHW118 
contains the wild-type barstar DNA (SEQ ID No 1) under the control of the TA29 
promoter. Plasmid pTTS139 contains an improved barstar DNA containing an 
additional GCC alanine codon between the first and second codon of wild-type 
barstar DNA. Thus the barstar DNA in pTTS139 encodes the improved barstar 
protein of SEQ ID No 4. 

Two oilseed rape lines transformed with pTHW1 18, respectively designated 
as DBN366-0011 and DBN366-0029, and two oilseed rape lines transformed with 
pTTS139, respectively designated as DBN342-1010 and DBN367-0035 were 
selected for quantitative analysis. 

Oilseed rape flower buds with a length of about 3 to 4 mm were isolated from 
the plants and were crushed in liquid nitrogen. Proteins were extracted, and the 
amount of total extracted protein, as well as the amount of barstar detectable by 
Western blot, was determined, all essentially as described in Example 3. 
Table 4 presents the amount of total extractable protein (column 4) and the amount 
of barstar (column 2) found in each line. 
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Barstar activity was assayed essentially as described in Fitzgerald and Hartley 
(1993, supra ). To 600 \i\ TE buffer (10 mM Tris/HCI pH 8.0; 1 mM EDTA) was added 
20 (Jl of a 0.02 M NH4 acetate pH 8.0 solution containing 10 pg/ml bovine 
serum albumin and 0.1 pg/ml barnase, 
5 - "x" n I of a 1 in 5 dilution of proteins extracted from the flower buds ("x" is 0, 4, 
8, 12, 16, 20 respectively). 

After mixing and waiting for about 1 minute, 6 \i\ of a 1 in 10 dilution of a stock 
solution containing 0.4 mg/ml polyethenoadenosinephosphate in TE buffer was 
added. 

10 The final solution was mixed and transferred immediately to a cuvette and the 
increase in fluorescence was recorded for one to two minutes. 

The values of the initial increase in fluorescence/minute was plotted against 
"x" (the volume of barstar containing solution). In each case a linear relationship was 
obtained, as expected. The intercept of the regression line with the X-axis was 
CS 15 calculated (Table 4, column 3): this represents the volume of Y which contains 
sufficient barstar to completely neutralize 2 ng of barnase (i.e. the volume containing 
^ a molar amount of barstar which is equivalent with 2 ng barnase). From this the 

as 

amount of "active" barstar in the flower buds (Table 4, column 5), as well as the 
lJ proportion of "active" barstar to total barstar protein detected by Western blot (Table 



fcsf 

y| 20 4, column 6)), can be determined. 

Isstf* 



From these ratios it can be deduced that the activity of improved barstar is at 
least equivalent with that of wild-type barstar. However, in lines transformed with 
pTHW1 1 8 (expressing wild type barstar) the average ratio is 0.27 (standard deviation 
25 0.08), while in lines transformed with pTTS139 the average ratio is 0.39 (standard 
deviation 0.08). The difference between the average ratios for the two types of lines 
is statistically significant (t = -26, p< 0.005). From this it can be concluded that 
improved barstar, when produced in oilseed rape flower buds will generally result in 
a higher level of active barstar as compared to wild type barstar. 

30 



All publications cited in this application are hereby incorporated by r ference. 



WO 98/10081 



-30 



PCT/EP97/04739 



Table 1 



Amino 
Acid D) 


Barstar 
codon c) 


Nr*> 


Plant a) 


Oilseed 


Cotton 


Maize 


Rice 


Wheat 


I 1 


CTC 


Q 


+ 


+ 


+ 




+ 


CTT 


O 


+ 


+ 


+ 


+ 


+ 


TTG 


I 




+ 




+ 


+ 


Ser 






+ 


+ 


+ 


+ 


+ 


TPP 


I 


+ 




+ 


+ 


+ 


Arg 


MOO 


Q 
O 


+ 


+ 




+ 




Thr 
1 1 if 




A 


+ 


+ 


+ 


+ 


+ 


Pro 


nrr 

www 


1 


+ 


- 




+ 


- 


CCG 


1 
I 


+ 


- 


+ 




- 


Ala 


\J w w 








+ 


+ 




GCT 


1 

1 


+ 


+ 


+ 


+ 




Glv 


GGC 




- 




+ 


+ 


+ 


GGT 


1 

1 


+ 


+ 


- 


- 


- 


GGG 


1 


+ 


+ 


- 


- 


+ 


Val 


GTG 


4 


+ 


+ 


+ 


+ 




GTC 


1 


+ 


+ 


+ 


+ 


+ 


Lvs 


AAG 


6 






+ 


+ 


+ 


Asn 


AAC 


3 


+ 


+ 




+ 


+ 


Gin 


CAG 


5 




+ 


+ 




+ 




CAA 


1 


+ 


+ 


+ 


+ 


+ 


His 


CAC 


1 

I 


+ 


+ 


+ 


+ 


+ 


Glu 


GAG 


1 1 

1 1 


+ 




+ 


+ 


+ 


Asp 


GAC 


4 


+ 


+ 


+ 




+ 


Tyr 


TAC 


3 




+ 


+ 


+ 


+ 


Cys 


TGC 


2 


+ 


+ 


+ 




+ 


Phe 


TTC 


2 


+ 


+ 


+ 


+ 


+ 


lie 


ATC 


6 


+ 


+ 


+ 


+ 





b) 
c) 
d) 



■ t. .w uvi^wi^u piciiii, !■ i w owuiSJl JO ail wpillJICU UUUUi I III 

that plant i.e. is a codon that has in that plant a frequency which is at 
least twice the frequency of the least frequent codon in that plant 
and/or which is at least 50% of the frequency of the most used codon 
in that plant. Frequency of codons in the plant species are those list d 
(in per thousand) by Ikemura ( supra ) 
- : indicates that, in the selected plant, the codon is not an optimal codon 

in that plant as defined above 
amino acid for which multiple codons exist (not Trp, Met) 
codon used in the synthetic barstar DNA of SEQ ID No 3 for the amino acid 
number of amino acids encoded by this codon in SEQ ID No. 3 
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Table 2 



piasrniu 


Piltor 

Nr 


ny Darsiar 
protein 8 ' 


bus 
activity"' 


correcieo 
barstar^ 


Mean 






1 


2.8 


0.6 


4.7 






pLH43 


2 


3.7 


0.6 


6.2 


5.4 


0.8 




3 


2.7 


0.5 


5.4 








1 


2.4 


0.8 


3.0 






pMV71 


2 


3.6 


1.3 


2.8 


2.7 


0.4 




3 


1.1 


0.5 


2.2 






pTS410 


1 


0.6 


0.8 


0.8 






2 


0.8 


0.8 


1.0 


0.9 


0.1 



a) Amount of barstar/50 pg loaded protein as determined by Western blot 

b) Gus activity is in arbitrary units. 

C) Corrected barstar: ng barstar / units of GUS activity. 
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Table 3 





i\ir 


Irlasmid 


ILinp 


I nn har.Stpr/m^ tntal protein 




•4 
1 


pTS 173 


C1 11 


<2 a) 




2 




C1 13 


4 


5 


3 




C1 17 


<2 ' 




4 




C118 


2 




5 




C120 


6 




6 




C121 


2 




7 




C125 


4 


10 


8 


pTTS24 


OSC1156 


21 




9 




OSC1160 


36 




10 




OSC1178 


<12 a) 




1 1 




OSC1181 


39 




12 




OSC1185 


<12 a) 


15 


13 




OSC1187 


29 




14 




OSC1188 


78 




15 




OSC1208 


22 




16 




OSC1210 


<12 a) 




17 




OSC1212 


69 


20 


18 




OSC1219 


72 




19 




OSC1226 


<12 a) 




20 




OSC1229 


42 




21 




OSC1235 


<12 a) 




22 




OSC1237 


20 


25 


23 




OSC1239 


26 




24 




OSC1241 


<12 a) 




25 




OSC1242 


43 




26 




OSC1247 


84 




27 




OSC1249 


42 


30 


28 




OSC1251 


65 




29 




OSC1254 


185 




30 




OSC1258 


53 




31 




OSC1268 


130 




32 




OSC1271 


75 


35 


33 




OSC1272 


30 




34 




0SC1274 


138 




35 




OSC1276 


<12 a) 




36 




OSC1278 


86 




37 


< 


DSC1287 


26 


40 . 


38 




DSC1288 


32 




39 


( 


DSC1293 


<12 a) 



a) below detection limit 
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